Multi-Channel Speech Enhancement using a Minimum Variance Distortionless Response Beamformer based on Graph Convolutional Network
نویسندگان
چکیده
The Minimum Variance Distortionless Response (MVDR) beamforming algorithm is frequently utilized to extract speech and noise from noisy signals captured multiple microphones. A frequency-time mask should be employed compute the Power Spectral Density (PSD) matrices of signal interest obtain optimal weights for beamformer. Deep Neural Networks (DNNs) are widely used estimating time-frequency masks. This paper adopts a novel method using Graph Convolutional (GCNs) learn spatial correlations among different channels. GCNs integrated into embedding space U-Net architecture estimate Complex Ideal Ratio Mask (cIRM). We use cIRM in an MVDR beamformer further improve enhancement system. simulate room acoustics data experiment extensively with our approach types microphone array. Results indicate superiority when compared current state-of-the-art methods. metrics obtained by proposed significantly improved, except Scale-Invariant Source-to-Distortion (SI-SDR) score. Perceptual Evaluation Speech Quality (PESQ) score shows noticeable improvement over baseline models (i.e., 2.207 vs. 2.104 2.076). Our implementation can found following link: https://github.com/3i-hust-asr/gnn-mvdr-final.
منابع مشابه
Minimum variance distortionless response (MVDR) modeling of voiced speech
In this paper we propose the MVDR method, which is based upon the Minimum Variance Distortionless Response (MVDR) spectrum estimation method, for modeling voiced speech. Developed to overcome some of the shortcomings of Linear Prediction models, the MVDR method provides better models for medium and high pitch voiced speech. The MVDR model is an all-pole model whose spectrum is easily obtained f...
متن کاملA minimum variance distortionless response filter based on the bifrequency spectrum for single-channel noise reduction
Article history: Available online 1 July 2014
متن کاملMinimum Variance Distortionless Response Beamformer with Enhanced Nulling Level Control via Dynamic Mutated Artificial Immune System
In smart antenna applications, the adaptive beamforming technique is used to cancel interfering signals (placing nulls) and produce or steer a strong beam toward the target signal according to the calculated weight vectors. Minimum variance distortionless response (MVDR) beamforming is capable of determining the weight vectors for beam steering; however, its nulling level on the interference so...
متن کاملMinimum variance distortionless response on a warped frequency scale
In this work we propose a time domain technique to estimate an all-pole model based on the minimum variance distortionless response (MVDR) using a warped short time frequency axis such as the Mel scale. The use of the MVDR eliminates the overemphasis of harmonic peaks typically seen in medium and high pitched voiced speech when spectral estimation is based on linear prediction (LP). Moreover, w...
متن کاملRegularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition
In this paper, we present robust feature extractors that incorporate a regularized minimum variance distortionless response (RMVDR) spectrum estimator instead of the discrete Fourier transform-based direct spectrum estimator, used in many front-ends including the conventional MFCC, to estimate the speech power spectrum. Direct spectrum estimators, e.g., single tapered periodogram, have high var...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2022
ISSN: ['2158-107X', '2156-5570']
DOI: https://doi.org/10.14569/ijacsa.2022.0131088